WO 2005/003344 



1 



PCT/EP2003/006839 



METHOD FOR THE ISOLATION OF EXPRESSED SEQUENCE TAGS IN 
PLANTS 

The present invention relates to a method for the iso- 
lation of Expressed Sequence Tags (ESTs) which are based on 
5 the use of suitable quantities of 5-azaCytidine in the ger- 
mination phase of plants to induce the demethylation phe- 
nomenon of the DNA. In this way, it is possible to obtain 
"totipotent" seedlings, i.e. capable of expressing as many 
genes as possible regardless of the development phase of 
10 the plant and isolating said genes by the synthesis of a 
single cDNA library. 

The term cDNA library means the combination of cDNA 
prepared from the whole population of mRNA of a tissue, a 
cellular line or an organism. It represents the collection 
15 of genes expressed in a cell or in a particular tissue, 
cloned in a suitable vector. 

In order to obtain an overall picture of the genes 
which are expressed in the various development stages of an 
organism, it is necessary to prepare several libraries us- 
20 ing mRNA extracted from the various tissues, or synthesize 
a single EST library using a messenger RNA pool extracted 
from different tissues. 

These systems however involve an enormous amount of 
time, work and money. 
25 The disadvantages of the known art mentioned above can 
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be overcome by means of the method of the present invention 
which is based on the use of suitable quantities of 5- 
azaCytidine in the germination phase of plants to induce 
the DNA demethylation phenomenon. In this way, it is possi- 
5 ble to obtain "totipotent" seedlings after a few weeks, 
i.e. capable of expressing as many genes as possible re- 
gardless of the development phase of the plant and isolat- 
ing said genes by the synthesis of a single cDNA library. 

The DNA methylation process is involved in various 
10 fundamental cellular events, such as, for example, embryo- 
gene development and genetic disorder. Furthermore, this 
process is considered as being one of the most important 
control mechanisms for genomic imprinting. 

It intervenes in fact in the regulation processes of 
15 the gene expression by methylating the sections of DNA cor- 
responding to the genes which must not be expressed in a 
certain tissue development phase. 5-AzaCytidine, a molecule 
which, if present during DNA duplication in the cells, can 
substitute cytosine, is not methylated by methyl trans - 
2 0 f erase and the genome and all the subsequent genomes, deriv- 
ing from the hypomethylated helix remain hypomethylated. 
The inhibition of methyl transferase and hypomethylation of 
the DNA consequently favour a "totipotent" gene expression. 
In accordance with this, the objective of the present 
25 invention relates to a method for the isolation of sequence 
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tags which are expressed in various development phases of 
plants which comprises the following steps: 

(a) germination of seeds in suitable soil in the presence 
of 5-azaCytidine in quantities ranging from 0.1 mM and 

5 2 mM; 

(b) extraction of nucleic acids from the shoots grown as 
indicated in step (a) ; 

(c) synthesis of the cDNA library starting from the nu- 
cleic acids extracted in step (b) ; 

10 (d) sequencing of EST clones and comparison with data 
banks . 

Description of the figures 

Figure 1: this shows the chemical structure of Cytosine (a) 
and 5-AzaCytidine (b) . 

15 Figure 2 : Digestion of the genomic DNA of hard corn, Ofanto 
variety, with restriction enzymes sensitive to methylation 
(Cfol, Hpa2, Mspl) . Samples: Ka = DNA extracted from field 
leaves, the remaining samples are DNA extracted from seed- 
lings germinated in a growth chamber on MSO medium contain - 

2 0 ing 5-Aza-Cytidine at different concentrations: Kb = 0 mM, 
A = 0.1 mM, B = 0.3 mM, C = 0 . 5 mM and D = 1 mM. 
Ml and M2 are 2 molecular weight markers, 1 Kb (Gibco-BRL) 
and MXIV (Roche) respectively. 

Figure 3 : PCR Amplification on cDNA extracted from seed- 
25 lings germinated in a growth chamber on MSO medium contain- 
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ing 5-Aza-Cytidine at different concentrations K = 0 mM, A 
= 0.1 mM r B = 0.3 mM, C = 0.5 mM and D = 1 mM. The abbre- 
viations below indicate the pairs of primers used in the 
PCR reactions. Aend » EST from endosperm; Agli = gene en- 

5 coding a-gliadin; APA = EST from precocious inflorescence 
phases; ThioM = ThioredoxinaM and ThioH = ThioredoxinaH . Ml 
and M2 are 2 molecular weight markers, 1 Kb (Gibco-BRL) and 
MXIV (Roche) respectively. 

Figure 4 : Amplification carried out on 24 phagic plaques 
10 selected at random to verify the size of the insert. K in- 
dicates the vector without an insert, M the molecular 
weight marker l Kb (Gibco-BRli) . 

Figure 5 i Summary graph of FAST As relating to the sequence 
analyses carried out on some clones (385) of the "totipo- 
15 tent" EST libraries of hard corn. The graph indicates the 
representation percentages of each cellular type or tissue 
with which the best homology was found, for each clone. 
Figure 6 : PCR amplification on cDNA extracted from seed- 
lings germinated in a growth chamber on MSO medium contain- 
2 0 ing 5-Aza-Cytidine at different concentrations K = 0 mM, A 
= 0.1 mM, B m 0.3 mM, C = 0.5 mM, D = 1 mM, E = genomic DNA 
of tomato plants, 0 = PCR mix without nucleic acids. Ml and 
M 2 are 2 molecular weight markers, 1Kb (Gibco-BRX.) and 
MXIV (Roche) respectively. 
2 5 Whereas the red arrow indicates the fragment amplified by 
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cDNA, the black arrow indicates the fragment amplified by 
the genomic DNA. 

1 = amplification with the primers specified for the gene 
of pollen LAT59 

5 2 = amplification with the primers specified for the gene 
of anthers LAT52 . 

Table 1 : Summary table of FAST As , relating to the sequence 
analyses carried out on some clones (50) of the EST library 
of hard corn. The table indicates, for each clone, the two 
10 sequences with the best homology. In some cases, a single 
sequence was inserted, if the homology percentage of the 
Subsequent Ones is lower than 60.0%. 
Detailed description of the invention 

The method of the present invention is illustrated 
15 hereunder with reference to hard corn seeds, but can be 
used for any plant . 

According to this method, the germination of the seeds 
is generally effected at a temperature ranging from 20 to 
3 0°C, preferably from 22 to 26° C, in the dark and in the 
2 0 presence of concentrations of 5-azaC ranging from 0.1 mM to 
2 mM, preferably from 0.3 mM to 0.5 mM. 

The phenotype of the seedlings being tested, compared 
with the control phenotype of those cultivated without 5- 
azaC, is strictly correlated to the concentration of 5-azaC 
2 5 used, an increase in the concentration of 5-azaC corre- 
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sponds to a slowing down in the development of the seed- 
lings, with respect to both the aerial part and the roots. 

Approximately 21 days after germination, the DNA, to- 
tal RNA and messenger RNA were extracted from the seed- 
5 lings. 

Analyses for correlating the demethylation degree of 
the DNA with an increase in the undifferentiated gene ex- 
pression were carried out by digestion of the genomic DNA 
with restriction enzymes sensitive to methylation, i.e. en- 

10 zymes which preferentially act on hypomethylated DNA. 

On the basis of the results obtained, a different di- 
gestion of the genomic DNA is observed, in relation to the 
concentrations of 5-azaC adopted. 

Furthermore, to confirm the efficiency of the method 

15 used, several genes were identified which are expressed in 
different growth phases of soft corn, of which a large num- 
ber of sequences are available as this species of corn is 
more widely studied that hard corn. On the basis of the se- 
quences deposited in data banks (National Center for Bio- 

2 0 technology Information, NCBI; European Bioinf ormatics In- 
stitute, EBI) pairs of oligonucleotides identified in the 
zones flanking the region encoding the nucleotidic se- 
quence, were prepared, and subsequently used as primers for 
the amplification of the whole gene being tested. 

25 The name and membership tissue of the genes selected 
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are listed below: 

Aesend, EST from immature endosperm; 

Aesgliad, gene which encodes a-gliadin in the seeds; 
Aespre-Ant, EST which is expressed in precocious in- 
5 florescence phases; 

Thiom, gene which encodes thioredoxinaM in all the 
tissues ; 

Thioh, gene which encodes thioredoxinaH in all the 
tissues . 

10 Each expressed sequence tag was isolated using the po- 

lymerase chain reaction technique (RT-PCR) on the total RNA 
with a pair of oligonucleotides flanking the encoding re- 
gion. 

The cDNA inserts were first amplified with the appro- 
15 priate primers, cloned in the vector p-GEMT and introduced 

into the competent cells Escherichia coli ( E.coli) DH5a. 
The recombinant clones, containing the expected fragments, 
were characterized by restriction analysis and their iden- 
tity was confirmed by effecting sequence reactions carried 
2 0 out using the ABI Prism Big TaqDyeDeoxyTerminator Cycle Se- 
quencing Kit (Applied Biosystems, Nr. 4 3 03149) and analyzed 
with the automatic ABI 377 DNA Sequencer (Perkin Elmer ABI 
Prism) . 

The nucleotidic sequences obtained were compared with 
25 those deposited in public data banks (NCBI, EBI) . An analy- 
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sis of the data confirmed the efficiency of the method, 
demonstrating that the best results are obtained with a 
concentration of 5-AzaCytidine in the germination medium 
equal to 0.3-0.5 rnlVL 
5 A cDNA expression library from plants of Triticum tur- 

gidum germinated in MSO medium containing 0.3 mM of 5- 
AzaCytidine, was subsequently prepared, in the phagic vec- 
tor lambda Uni-ZAP XR. Polyadenylated messenger RNA was 
used for the synthesis of the double-strand cDNA operating 
10 according to the procedures suggested by the kit distribut- 
ing company (STRATAGENE) . 

The molecules of cDNA with a high molecular weight 
used for constructing the library were separated from those 
with a low molecular weight which represent the fraction of 
15 molecules in which the synthesis was not completed. 

The fraction of cDNA corresponding to the high molecu- 
lar weight fraction was inserted in the phagic vector 
lambda Uni-ZAP XR and packed with the packaging extracts 
containing proteins for the head and tail of the phage. The 
2 0 total quantity of phagic particles obtained from the pack- 
aging in vitro was determined by plating small aliquots 
with the host bacterial strain XLl-Blue MRF 1 . 

The dimensions of the inserts, present in the library 
produced, were verified by subjecting various phagic 
25 plaques selected at random to amplification reaction and 
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using a specific pair of primers for the vector Uni-ZAP XR. 

The results obtained showed that the fragments of the 
primary cDNA library have an average dimension ranging from 
0.5 to 1.6 Kb. 

5 For the amplification of the primary library, the 

phages were used to infect the host cells XL1 Blue MRF 1 
which, by allowing the replication of the phages in their 
inside and following their lysis, enabled a library con- 
sisting of about 1 x 10 9 phagic particles per ml, to be re- 
10 covered. 

After amplification, the library in the lambda phage 
was converted to a piasmidic library by means of total ex- 
cision in vivo . The vector Uni-ZAP XR was prepared so as to 
allow an efficient excision in vivo of the inserts cloned 
15 in the lambda vector to form phagemid. 

Excision in vivo depends on particular DNA sequences 
present in the vector and with different proteins, includ- 
ing those proteins deriving from the helper phage, and is 
favoured by SLOR cells which, due to their characteristics, 
2 0 eliminate problems associated with co- infection with the 
helper phage, by inhibiting it. 

SLOR cells of the strain E .coli were therefore trans- 
formed with phagemids, plated on selective medium for 
pBluescriptsk (+/-) phagemid containing ampicillin to form 
25 colonies. The titer of the library in excised phagemid is 
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1.5 x 10 12 colonies per ml. 

The DNA extracted from the colonies was then used for 
sequence analysis. 

The results of the sequencing of EST clones and their 
5 comparison with data banks showed homologous sequences with 
different tissues (leaves, ripe seeds, flowers and roots) 
even though the seedlings were only 21 days old. As can be 
observed in figure 5, in fact, homologies can be seen with 
genes involved in the starch metabolism, genes encoding re- 
10 serve proteins, which are expressed in the roots or inflo- 
rescence phases. As expected for a typical cDNA library, 
20% of the EST clones sequenced did not show any homology 
with sequences whose function is known. 

The following examples are illustrative but do not 
15 limit the scope of the invention described. 

The method of the invention can usually be applied in 
kits for the synthesis of totipotent cDNA libraries. In 
practice, the new kit should also comprise, in addition to 
the kit components currently on the market (as described 
2 0 for example in Table 1 of the handbook for the Library Con- 
struction of STRATAGENE Catalogue Nr. S2 00450 ) , the neces- 
sary components for the embodiment of the method of the in- 
vention, such as: 

a solution of 5-AzaCytidine; 
25 - tomato and/or wheat seeds on which the test has been 
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verified; 

oligonucleotides of genes amplified with the system 
proposed; 

Description of the procedure for the embodiment of the 
5 method of the invention. 

EXAMPLE 1 

Extraction of the genomic DNA and messenger RNA 

Seeds of hard corn ( Triticum durum) were sterilized in 
a solution of ethyl alcohol at 70% for 10 minutes under 
10 stirring. After eliminating the ethanol, the seeds were 
treated with a solution containing sodium hypochlorite 
(ACE) at 50% and Sodium Dodecyl Sulfate (SDS) at 0.5%, in- 
cubated at room temperature for 20 minutes under light 
stirring. 

15 The seeds were subsequently washed with sterile H 2 0 

until the complete removal of the foam (about 7-8 times) 
and about 15-20 seeds were then placed in a Magenta Box 
each containing: 3 0 ml of MSO substrate (Sucrose 15 g/1, 
Muraschige-Sckug MS salts 2.2 g/l, Thiamin 0.2 mg/1, Myo- 

20 inositol 50 mg/1, pH 5.6, agar 7 g/1), different concentra- 
tions of 5-azac (0.1 mM, 0.3 mM, 0.5 mM and 1 mM) and ger- 
minated in a chamber thermostat-regulated at 24 °C in the 
dark. 

Seeds of hard corn left to germinate in MSO medium 
25 without 5-azac, were used as a control. 
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After about 21 days of germination, the DNA, total RNA 
and messenger RNA were extracted from the seedlings using 
the Invitrogen kit (Fasttrack™ 2.0 Kit, Nr. K1593-02, 
K1593-03) . 

5 Analyses for correlating the demethylation degree of 

the DNA with an increase in the undif f erentiated gene ex- 
pression were carried out by digesting the genomic DNA with 
the following restriction enzymes: Cfol, Hpa2, Mspl 
(Roche) , sensitive to methylation. 

10 For each reaction, about 5 ^ig of total DNA were di- 

gested with 20 units of enzyme in a final volume of 150 jj.1, 
for a night at 37*C. Bach digestion product was precipi- 
tated and resuspended in 3 0 xxl of H 2 €>. The digestion mix- 
tures were separated on agarose gel at 0.8% and subjected 

15 to horizontal electrophoresis, for a night at 25 mV (Sam- 
brook, J. et al., 1989, Cold Spring Harbor Laboratory 
Press) . 

In all cases differences were observed with respect to 
the control, which indicate a different methylation of the 
2 0 genome analyzed (Figure 2) . 

Furthermore, in order to confirm the efficiency of the 
method used, some of the genes identified in literature 
which are expressed in different growth phases of soft 
corn, were isolated. 

25 About 3 \xg of total RNA were used for the synthesis of 
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double -strand cDNA using the kit distributed by PHARMACIA 
(catalogue nr.: 27-9260-01). The experimental conditions 
used were those suggested by the distributing company of 
the kit . 

5 A pair of nucleotides flanking the encoding region was 

prepared for each gene. The name, size and pair of primers 
in the direction 5^3' , are listed below: 

• Aesend 58 8 bp; EST from endosperm; Accession number: 
BE401963; 

10 5' GGATCCTTCCAGAGTACCTG 3' (FORWARD, EndFor) 

5 f TCTAGATAGCACTACCTACAAACAC 3 1 (REVERSE, EndRev) . 

• Aesgliad 1Q92 bp; alpha gliadxn gene; Accession number; 
U08287; 

5' GGATCCGGTCAATACAAATCC 3' (FORWARD, GliaFor) 
15 5' AAGCTTCACCGCTACAACGACC 3' (REVERSE, GliaRev) . 

• Aespre-Ant 56 6bp; EST from pre-anthesis spike; Accession 
number: BE500795; 

5' GGATCCAAACGGCGCCCG 3' (FORWARD, PAntSFor) 

5' TCTAGATTTACTGCACTAGGAC 3 1 (REVERSE, PAntSRev) . 

20 • Thiom 781 bp; thioredoxina M; Accession number: AJ005840; 
5' GGATCCCTCCCTCTGTCTCC 3' (FORWARD, ThioMFor) 
5' TCTAGACGAACATGCATGTATACTG 3' (REVERSE, ThioMRev) . 

• Thioh 630 bp; thioredoxina H; Accession number: AJ001903 . 
5' GGATCCCGTGAGAAATAAGCG 3' (FORWARD, ThioHFor) 

2 5 5' T CTAGATG AAATCAAC CATTTAC CG 3' (REVERSE, ThioHRev) . 
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These oligonucleotides were used for the isolation of 
the corresponding fragments by means of the polymerase 
chain reaction technique (PCR) . 

The amplification was carried out in a GeneAmpPCRSys- 
5 tems9700 R thermal cycler (PE AppliedBioSystems) using, for 

each reaction, a mixture (25 |il) containing 6 \il of double - 
strand cDNA, 10 mM Tris-HCl pH 8 . 3 , 50 mM KC1, 1.5 mM 

MgCl 2 , 2.5 |iM of each primer, 0.1 mM of dNTP and 2.5 Units 
of Taq DNA polymerase (Roche) . 
10 After a first denaturation cycle for 5 minutes at 

95°C, the reaction continued with the following cycles: 
1 minute at 34 °C {denaturation) 

1 minute at 56 °C (pairing) 

2 minutes at 72 °C (elongation) 

15 for a total of 35 cycles, followed by 10 minutes at 72 °C 
(final extension) . 

The amplification were separated on agarose gel at 1%; 
the DNA bands of interest were recovered and purified with 
the GeneClean™ kit (BIO 101 Inc., USA, Nr. 1001-400). 

2 0 About 100 ng of the DNA thus isolated, for each ampli- 

fication product, were ligated to 50 ng of pGEM-T plasmid 

(Promega, Nr. A3600) in 10 ^il of reaction mixture, in the 
presence of 2 units of T4 DNA ligase (Promega, Nr. A3600) 
and incubated at 4°C for a night. 
25 2 jil of each mixture were used to transform competent 
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cells of E.coli DH5a (BRL, Nr. 18258-012) . 

The recombinant clones were selected on Petri plates 
of LB solid medium (NaCl 10 g/1, yeast extracts 5 g/1, 
Bacto-triptone 10 g/1 and agar 20 g/1) containing 100 mg/1 
5 of ampicillin. 

6 clones were identified for each transformation 
event, from which the plasmid DNA, adopted for the sequence 
analyses, was extracted. The reactions and sequence analy- 
ses were carried out with the ABI Prism Big TaqDyeDeoxyTer- 
10 minator Cycle Sequencing kit (Applied Biosystems, Nr. 
4303149) , using the GeneAmpPCRSystem9700 R (PE Applied Bio- 
Systems) as thermal cycler and the ABI Prism 3 77 DNA Se- 
quencer (Applied Biosystems) as sequencer. 

The sequence analyses carried out for each gene showed 
15 homology with the sequences of the genes selected, thus 
confirming the efficiency of the method used. 
EXAMPLE 2 

Construction of the cDNA hard corn library 

The messenger RNA was extracted from etiolated seed- 
2 0 lings of hard corn, Ofanto variety, germinated on MSO me- 
dium containing 5-AzaCytidine 0.3 mM, using the Invitrogen 
kit (Fasttrack™) 2.0 Kit, Nr. K1593-02, K1593-03) . 

The experimental conditions suggested by the kit dis- 
tributor (Library Construction Kit, Stratagene, Catalogue 
25 Nr. S200450) were used for the synthesis of the EST (Ex- 
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pressed Sequence Tags) library. 

About 5 \xg of polyadenylated messenger RNA of hard 
corn were used for the synthesis of double- strand cDNA. 

The end of the cDNA molecules were flattened by the 
5 action of DNA polymerase Pfu (5 Units, Stratagene) , con- 
taining 3.6 |ig of linkers having the restriction site EcoRI 
and subjected to digestion with the enzyme Xhol (12 0 Units) 
whose site is present in the polydT primer used for the 
synthesis of the first strand of the cDNA, 
10 This gives rise to molecules having the EcoRI site at 

one end and the Xhol site at the other. 

In order to separate molecules of cDNA with a high mo- 
lecular weight, useful for constructing the library, from 
those having a low molecular weight represented by the 
15 fraction of "molecules in which the synthesis had not been 
completed, the cDNA sample of Triticum durum was passed on 
a Sephacryl R S-500 column, equilibrated in 2 0 mM of Tris- 
HC1 pH 7.5, 10 mM EDTA, 100 mM of NaCl, and subjected to 
centrifugation for 2 minutes at 400 x g. 
2 0 Three fractions of the library were recovered and 

their molecular weight was subsequently verified by the 
separation of an aliquot of each fraction on non- denaturing 
polyacrylamide gel at 5% (Sambrook, J. et al . (1989), Cold 
Spring Harbor Laboratory Press) . 
2 5 About 100 ng of the first fraction of cDNA recovered, 
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corresponding to the high molecular weight fraction, were 

directionally ligated with 1 \ig of the lambda phagic vector 
Uni-ZAP XR, pre -digested with EcoRI and Xhol, and packed 
with the packaging extracts containing proteins for the 
5 head and tail of the phage . 

The total quantity of phagic particles obtained from 
the packing in vitro was determined by plating small ali- 
quots with the host bacterial strain XLl-Blue MRF 1 , i.e. by 
effecting its titer. 
10 The primary library obtained contains a total of 2.3 x 

10 6 units forming plaques (pfu) per p.g of arms of the li- 
gated vector and 97% of these contain the DNA insert. 
The following controls were effected parallelly: 

(1) verification of the ligase efficiency using the vector 
15 pBR322 suitably linearized (supplied with the kit) , as in- 
sert to be ligated to the phagic vector; 

(2) verification of the packaging efficiency in which an 
aliquot of lambda DNA was packed and finally ; 

(3) control for monitoring the library background (non- 
20 recombinant clones) in which a ligase was effected, and 

consequently the packaging of the phagic vector alone. 

The dimension of the library produced was verified by 
subjecting 24 phagic plaques selected at random and ampli- 
fied with a pair of specific primers for the vector Uni-ZAP 
25 XR, to PCR reaction- 
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a) Primer Forward: 5' GTAAAACGACGGCCAGT 3 1 (pBSKFor) ; 

b) Primer Reverse: 5' GGAAACAGCTATGACCATG 3 ! (pBSKRev) . 

The results obtained showed that the inserts of the cDNA 
library had an average dimension of 0.8 Kb (Figure 4) 
5 EXAMPLE 3 

Conversion of the phagic library Uni-ZAP XR to a plasmid 
library 

The primary corn library was subsequently amplified to 
make it more stable and converted into a phagemid, in the 
10 phagemid vector pBluescriptSK of 2958 bp, by means of total 
excision in vivo according to the method described by the 
distributor of the kit adopted (Library Construction Kit, 
Stratagene, Nr. S200450) . 

More specifically, the host cells XL1 Blue MRF 1 were 
15 diluted to OD 60 o 0.5 in 10 mM of MgS0 4 . To amplify 1 x 10 6 
plaques, 20 aliquots of library were used, each aliquot 

contains 5 x 10 4 plaques (11 with which 600 jil of host 

cells were infected, and each aliquot was distributed on 
150 mm plates containing NZY medium (NaCl 5 g/l f MgS0 4 2 
2 0 g/1, yeast extracts 5 g/1, NZ amines, hydrolyzed casein 10 
g/1 and agar 15 g/1, pH 7.5), incubated for 8 hours at 
37°C. 

The following day 10 ml of SM buffer (NaCl 5.8 g/1, 
MgS0 4 2 g/1, Tris-HCl 1M, pH 7.5, 50 ml, gelatin 2% 5 ml) 
2 5 were added to each plate and incubated for the whole night 
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at 4°C under stirring. 

Once the phagic suspension had been recovered, the 
titer of the library was controlled again, and proved to be 
equal to about 7 x 10 9 phagic particles per ml. To preserve 
5 the phagic library, the suspension recovered (40 ml) was 
divided into aliquots and distributed in 1 ml tubes, 0.3% 
of chloroform was added to a part of the aliquots (20) , and 
preserved at 4°C, 7% of DiMethylSulf oxide was added to the 
remaining part (2 0) and preserved at -80°C. 
10 In order to effect the excision of the library, a 

lambda phage / XL1 Blue M RF 1 cells ratio equal to 1:10 was 
used, and a helper phage/ XLl Blue M RF 1 cells ratio equal to 
10:1. 

In practice, 10 0 \xl of the amplified library, corre- 
15 spending to about 1 x 10 s phagic particles, were incubated 
with 1 x 10 9 cells of XL1 Blue M RF 1 , i.e. about 8 ml, and 
with 1 ml of Ex&ssist helper phage, corresponding to 1 x 
10 10 pfu, to generate phagemid particles containing the 
plasmid vector excised from the phagic vector. 
20 The excess number of helper phages and E. coli cells, 

with respect to the number of phages of the library, was 
used to ensure that each cell was infected both by the 
helper phages and by the lambda phage in order to obtain an 
efficient and representative excision in vivo . 
25 The incubation of the lambda phages with the helper 
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phages and XL1 Blue M RF 1 cells took place fro 15 minutes at 
37°C, after which 20 ml of LB medium were added and the in- 
cubation was continued at 37°C for a further 3 hours. 

In order to lyse the phagic particles and allow the 
5 release and recovery of the phagemid particles, the suspen- 
sion was incubated at 70 °C for 2 0 minutes and then centri- 
fuged for 10 minutes at 500 x g. The surnatant was recov- 
ered and preserved at 4°C. 

1 x 10 8 phagemids (1 \xl of the surnatant) were subse- 
10 quently incubated with 2 00 \il of SLOR E .coli cells (ratio 
10:1), at 37°C for 15 minutes, in order to obtain SLOR bac- 
terial cells containing the hard corn EST library in a 
plasmid vector, 

3 tests were then carried out, by plating 100 jj.1, 10 
15 }il and l jil on solid LB medium (NaCl 10 g/1, yeast extracts 
5 g/1, Bacto-triptone 10 g/1 and agar 2 0 g/1) containing 
ampicillin at a concentration of 100 mg/1; the plates were 
incubated at 3 7 °C for 8 hours. 

The titer of the excised phagemid library is 1.5 x 
2 0 10 12 colonies per ml. The phagemid library was subsequently 
multiplied in the strain of E. coli SLOR and the clones ob- 
tained were distributed in plates with 96 cavities. 

The preservation medium is the "Cell Freezer Storage 
Medium" , consisting of LB (NaCl 10 g/1, yeast extracts 5 
25 g/1/ Bacto-triptone 10 g/1) , containing 100 mg/1 of am- 
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picillin and agar 20 g/1; 1 X Freezer Buffer (K 2 HP0 4 62.7 
g/1, KH 2 P0 4 18 g/1, Na Citrate 5 g/1, MgS0 4 1 g/1, (NH 4 ) 2 S0 4 
9 g/1, Glycerol 44 0 ml) . Each colony was transferred to a 
cavity containing 1 ml of the above solution containing 0.1 
5 ml of LB with ampicillin. The plates were incubated for a 
night at 3 7°C and then preserved at -80°C. 
EXAMPLE 4 

EST library screening, sequence analysis and comparison 
with data banks 

10 In order to effect the analysis of the expressed se- 

quence tags (EST) in the hard corn cDNA library, the plas- 
mid DNA was extracted from the plates with 96 cavities and 
subjected to sequence analysis. 

An aliquot (3 jal) taken from each single cavity, con- 
15 taining a clone, was incubated for a night at 37°C, in 1.2 
ml of LB with ampicillin. The purification of the DNA was 
carried out using BIOMECK2 000 (Beckman) , an automatic liq- 
uid handling station, and the extraction kit of Promega, 
(Wizard SV96, Plasmid DNA Purification System Nr. A2255) , 
2 0 effecting all the operations indicated by the manufacturer. 

The quantification of the extracted DNA was then ef- 
fected, charging 5 jjlI on agarose gel at 1%. 

Each clone was subjected to sequence reaction, using 
universal oligonucleotides as primers, which can be found 
2 5 in the polylinker of the pBluescriptsk vector in both di- 
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rections : 

• Primer Forward: 5 1 GTAAAACGACGGCCAGT 3» (pBSKFor) ; 

• Primer Reverse: 5 1 GGAAACAGCTATGACCATG 3 ! (pBSKRev) . 
About 350 ng of plasmid DNA were used for each se- 

5 quence reaction, following the procedures suggested by the 
protocol of the ABI Prism Big TaqDyeDeoxyTerminator Cycle 
Sequencing kit (Applied Biosys terns, Nr, 4 3 03149) . 

The amplifications were carried out using the equip- 
ment for PCR GeneAmp PCR System 9700 (PE-Applied Biosys- 
10 terns) . The reaction products were charged on LongRanger 
Single pack, denaturing polyacrylamide gel, type 377 (Biow- 
hittaker Molecular Applications, Nr. 50691) , and the 
electrophoresis run was carried out using the automatic se- 
quencer ABI 3 77 DNA Sequencer (Applied Biosys terns) . 
15 The sequence analyses were carried out using the Se- 

quencer™ program (Gene Codes Corporation) . 

The data obtained for each clone were compared with 
the sequences deposited in public data banks, such as FASTA 
and BLAST (National Center for Biotechnology Information, 
2 0 NCBI; European Bioinf ormatics Institute, EBI) . 

An example of the results obtained is indicated in Ta- 
ble 1 and in figure 5 . 
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Table 1 



Clone 


In- 
sert 

(bp) 


Accession 
number 


Homology 
\ /o j 


Organism 


Cellular type/ tissue 


cDNA 00010 


875 


BE060569 
AP003436 


82.8 
78.7 


Hordeum vulgare 
Oryza sativa 


Spike before flowering 
Non-specified tissue 


cDNA 0001 2R 


620 


AP003934 
AP003722 


81.0 
81.0 


Oryza sativa 
Oryza sativa 


Non-SDecified tissue 
Non-specified tissue 


cDNA 00013 


354 


A1834373 


62.0 


Zea Mays 


1 Inrinp QntW^ 


cDNA 00014 


759 


BE587421 
AW564255 


95.3 
68.7 


Secale cereale 
Sorghum bi color 


Root end 

QfipH linn nrniA/n in Vho Unhf 


cDNA 00015 


394 


BE405857 
BE637241 


81.9 
85.5 


Triticum aestivum 
Secale cereale 


Root 

AM IU 


CDNA 00016 


904 


BG418106 
BG873973 


89.0 
79.0 


Hordeum vulgare 
Zea mays I 


Head/pericarp 
Young seedling 


cDNA 00017 


915 


BF482801 
BF277210 


98.7 
67.4 


Triticum aestivum 
Gossypium arboreum 


Spike before flowering 
Fibres isolated from capsule 


cDNA 00018 


693 


BG905579 
BE216980 


98.4 
92.2 


Triticum aestivum 
Triticum aestivum 


Leaves 
Leaves 


_p.ii » r\/\f\Jt 

cDNA 00019 


1011 


BG418804 
BE606987 


89.0 
95.9 


Hordeum vulgare 
Triticum aestivum 


Head/pericarp 
Spike 


CDNA 00020R 


598 


X56882 
BE471153 


89.4 
91.1 


Triticum aestivum 
Triticum aestivum 


Embryonal axis 
Young seedling without en- 
oosperm 


cDNA 00021 


724 


BE429737 
BE426779 


94.2 
94.2 


Triticum aestivum 
Triticum aesHvum 


Non-specified tissue 


CDNA00022R 


595 


BG414796 
BG907822 


85.3 
98.17 


Hordeum vulgare 
Triticum aestivum 


Head/pericarp 
Leaves 


CDNA 00024R 


569 


BE366369 
BG1 02688 


81.6 
81.6 


Sorghum bicoior 
Sorghum propinquum 


Leaves with anthracnose 
Rhizomes 


cDNA 00025F 


616 


BE213392 
BE490543 


95.8 
96.4 


Triticum aestivum 
Triticum aestivum 


Leaves 
Young seedlinq 


cDNA 00034F 


628 


BE427302 
BE426779 


87.4 
96.2 


Triticum aestivum 
Triticum aestivum 


Head/pericarp 
Spike before flowering 


cDNA 00046F 


202 


BF200965 
BE488428 


87.5 
81.7 


Triticum aestivum 
Triticum aestivum 


Tissue of seedling crown 
Etiolated shoot 


cDNA 00050F 


532 


BE404845 
BE429889 


92.7 
94.3 


Triticum aestivum 
Triticum aestivum 


Root 

Non-specified tissue 
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EXAMPLE 5 

Verification of the method in tomato plants 

Tomato seeds (Lycopersicon esculentum, cv. Red Setter) 
were sterilized in a solution of ethyl alcohol at 7 0% for 
5 10 minutes under stirring. After eliminating the ethanol, 
the seeds were treated with a solution containing sodium 
hypochlorite (ACE) at 50% and Sodium Dodecyl Sulfate (SDS) 
at 0.5%, incubated at room temperature for 2 0 minutes, un- 
der stirring. 

10 The seeds were subsequently washed with sterile H 2 0 

until the foam had been completely removed (about 7-8 
times) and about 15-2 0 seeds were then positioned in a Ma- 
genta Box, each containing: 3 0 ml of MSO substrate (Sucrose 
15 g/1, MS Muraschigo-Sckug salts 2.2 g/1, Thiamine 0.2 
15 mg/1, Myoinositol 50 mg/1, pH 5.6, agar 7 g/1), different 
concentrations of 5-azac (0.1 mM, 0.3 mM, 0.5 mM and 1 mM) 
and germinated in a dark room thermostat-regulated at 24 °C. 

Tomato seeds left to germinate in MSO medium without 
5-azac, were used as a control. 
2 0 After about 21 days of germination, the total RNA was 

extracted from the etiolated seedlings, using the Invitro- 
gen kit (Fasttrack™ 2.0 Kit, Nr. K1593-02) . 

In order to confirm the efficacy of the method used, 
some of the genes identified in literature, which are ex- 
2 5 pressed in different growth phases of tomato plants, were 
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isolated. 

About 3 \xg of total RNA were used for the synthesis of 
double- strand cDNA using the kit distributed by PHARMACIA 
(catalogue Nr: 27-9260-01) . The experimental conditions 
5 adopted were those suggested by the kit supplier. 

A pair of oligonucleotides flanking the codifying re- 
gion was designed for each gene. The name, size and pair of 

oligonucleotides in the direction 5'->3', are provided be- 
low: 

10 • LAT59 1350bp; gene which is expressed in pollen; Acces- 
sion number: X15499; 

5' CAAAGGAGCCATTGTGGAT 3' (FORWARD, LAT59-For) 
5' GCTGGAGCTGCTGATATTCC 3' (REVERSE, LAT59-Rev) 

• LAT52 4 86bp; gene which is expressed in anthers; Acces- 
15 sion number: X15855; 

5' ATGGCAAAGGCTATTGTGCT 3/ (FORWARD, LAT52-For) 
5 ' CTCTTTGCAGTCCTCCCTTG 3' (REVERSE, LAT52-Rev) . 

These oligonucleotides were used for the isolation of 
the corresponding fragments by means of the polymerase 
2 0 chain reaction (PCR) technique. 

The amplification was effected in a GeneAmpPCRSys- 
tem9700 R thermocycler (PE AppliedBioSys terns) using, for 

each reaction, a mixture (25 \xl) containing 6 \xl of double - 
strand cDNA, 10 mM of Tris-HCl pH 8.3, 50 mM of KCl, 1.5 mM 

25 of MgCl 2 , 2.5 Jim of each primer, 0.1. mM of dNTP and 2.5 
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Units of Taq DNA polymerase (Roche) . 

After a first denaturation cycle at 95 °C for 5 min- 
utes, the reaction continued with the following cycles: 

1 minute at 94 °C (denaturation) , 
5 1 minute at 56°C (pairing) 

2 minutes at 72 °C (elongation) 

for a total of 3 5 cycles followed by 10 minutes at 72 °C 
(final extension) . 

The amplification products were separated on agarose 
10 gel 1%, the DNA bands of interest were recovered and puri- 
fied with a GeneClean™ kit (Bio 101 Inc., USA, Nr. 1001- 
400) . 

About 100 ng of the DNA thus isolated, for each ampli- 
fication product, was used for the sequence analyses. The 

15 reactions and sequence analyses were carried out with an 
ABI Prism Big TaqDyeDeoxyTerminator Cycle Sequencing kit 
(Applied Biosystems, Nr. 4303149) , using a GeneAmpPCRSys- 
tem9700R as thermocycler (PE AppliedBioSys terns) and, as se- 
quencer, an ABI Prism 3 77 DNA Sequencer (Applied Biosys- 

2 0 terns) . 

The sequence analyses carried out for each gene showed 
complete similarity with the sequences of the genes se- 
lected, thus confirming the efficacy of the method used. 

25 



